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CGGCTAAGAT GCTGAAGAGG TAGGAACTAG AGGATGCAGA ATCACTTTAC 26800 
TTTTCTTCTT TTTCCTTTTG A G AC AG AG T C TCACTCTGTC AGCCAGACTG 26850 
GAGTGCAGTG GTACAATCAT GGCTCACTGC AACTTCGACC TCCCAGGCTC 26900 
AAGCAATCCT CCCATCTCAG TCCCACAAAT AGCTGGGACT ACAGGTGCAC 2 6950 
ATCACCACAC CTGGCTACTT TAAAAAAATT TTTTTGTAGA GATGGGGTCT 27000 
CCCTGTGTTG CCCAGGCTGG TCTCTTGAAT TCCTGTGCTC AAGCCATCCT 27050 
TCCACCTCAG CCTCCCAGAG TGCCAGGATT ACAGGCATGA GCCACCACAC 27100 
CCAGCCACCA CTTTTCTTAA AAAAAAAAAA AGATTCTCTC TGGTAGACAA 27150 
TCCTCAATAG TCCACATGTT ATTAAACAAT CTGCTGCCTG AATACATGAT 27200 
TTACCAAAAA AAGGAAATTT TGACGGGTTC AGAATATCAA GGGATCTGAG 27250 
GCAAATGTCA CCTATGATAA AATTTGCTAT CAAAATTAGG AAGTTTGTGT 27300 
TTACCTGATC CTAAAGCAGT AACCAGCCCA TTTCTAGGGA ATAAAACTCT 27350 
CATGCGTATA TTGTGCATAT ATATGTATTA TATGACTGAG TGATAATAAA 27400 
ATTTTTTTTC TAGCTTCCTG AAGGCTGGTG GAGAAGTGAT TGATTCAGTT 27450 
ACATGGCATC AGTAAGTATG TCTCCTATTC TTAATACTAG GAAAGTAAGG 27500 
CTAGCTTTAT TTATTACCTA GTATTCAAAA AGTTAGTTCA TTTAACTGCC 27550 
AATTGACTGC AGTTCAAATA AGAAACAAAT AGTGTCTCAA GTAGCACTGT 27600 
ACTCCAATTT TAATATTAAT AAAAAAAATT TTAAGTTATT TTAAATAATG 27650 
TAGTGGTTTC TATAAAGATC ACTTTATACA GAAGAACAGT GCCAATTAAC 277 00 
CCATGGAACA TATAAGTAGC TAAAACCAAT TGCTTGCCAA AGAACCAGTA 27750 
ACCCAGGAGT ACATGTCCTT GCCACTGTGT TTTTTCAAGA CAGAGTAACT 27800 
GATTTCTAGT TACTTGCATA GAATGGACTC CTCCTCATAA CTCCCTTCCA 278 50 
TCTTGGTCTT TCCCTAGTAG AACTTCTACC TTTTTTTAGT AACAGGTGAG 27900 
TGGGAGAGGT AAGAAGGAGA ATAAGGTCAG CAATTAACCT AAAAGCAGAA 27950 
AGTAAAATTT GTTATTTTTT TTCTGAATAT TTTCTGTGTA ATTTAGCTAC 28000 
TATTTGAATG GACGGACTGC TACCAGGGAA GATTTTCTAA ACCCTGATGT 28050 
ATTGGACATT TTTATTTCAT CTGTGCAAAA AGTTTTCCAG GTAATAGTCT 28100 
TTTTAAACTT TTTAATGTAA AACCAGAATC CTTATTTTAT AGTCTAGCTA 28150 
GTTCTAAATT CTATAGGTAT GTATATTTAC ATGTTTTTCT AATTTTAGAG 282 00 
AACAAGCACT ATGACTTATC CACTGTTAGT TTTCCCCTTA GCATTGGGTC 28250 
TTACCCCATG TACGTGATTA GAAATTTGAA ATATTTCCAA TAGCCTTTAG 28 300 
TAGAATTAAC TCACATAGAT GATAAGAATG GGTTGGTTCA CTTCATGTTC 28350 
CTTCCACAGC CTACTATTTC AATAAAAGAA AGTTTCCCAA GACCTAAATG 28400 
ACTATGAACA TATTTTATAA CTATATAGGA GGGGTGGGTC TAGGAATACA 28 450 
AAGTTTTGAA TGCTGTTAAT CTTCAACACC ACAGTTGAAA CCACAGGTCA 28500 
GCTTTTTTGC AATTACCATG GATACTTTTC TGTTCTATAG GTGGTTGAGA 28550 
GCACCAGGCC TGGCAAGAAG GTCTGGTTAG GAGAAACAAG CTCTGCATAT 28 600 
GGAGGCGGAG CGCCCTTGCT ATCCGACACC TTTGCAGCTG GCTTTATGTG 28650 
AGTGAAGCAG CGCTGGCCTT AGGGGTCAGA GTGCAGCTCT TCTCCATCCT 28700 
TCTATTCTGC TGAAATAGCT CCCCAGCCAA AAAGCAGATC AAAGACCGTT 28750 
TCAGTGGCTG AGCCCCAAAA TTCATGCCAG ATTTTGCAAG AAAATGATTT 28800 
ACTAAAGCTT GAGGGAC AT C TTTAACAAGT GTTCCAAATT AATCACTATA 28 850 
AGGATGAATT GTTTCAGAAA TTTTGGCCTT TAATTATGGC CCATAAATAT 2 8 900 
GTCAAGTAGT CCTTACTCTA AAGAAGTACA CTGTAAAAGA ATGCATATAG 28 950 
CCGGATATGG TAGTTCCCTG TAATCCCAAT ACTTTGGGAG GCCAAGGTGG 29000 
GAGGATTGCT TGAGCCCAGG AGTTTGAGGC TGCAGTGAGT TATGATGGTG 29050 
CCACTGCACT CTAGACTGGG CAACAGAGTG AGACTGTCTT TTTTTTTCCC 2 9100 
CTCTGTCACC CAGACTGGAG GGCAGTGGCA CGATCTCACC TCACTGCAAC 29150 
CTCTGCCTCC CGGAT TGAAG CGATTCTCCT GCCTCAGCGT CCTGAGTAGC 29200 
TGGGACTACA GGAGTATCAC CGCACTGGGC TAATTTTTGT ATTTTTAGTA 29250 
GAGACGGGGT TTTGACATGT TGCCCAGGCT GGTCTGAAAC CCATGAGCTC 29300 
AAGTGATCTG CCTACCTCAG CCTTCCAAAA TGCTGGGATT ACGGACATGA 29350 
GCTACCACGC CCGGCCACAC CCTGTCTCTT AAAAAAAAAA AAAATGCAAG 29400 
TTAGAGCATA TTACAGCTTT GTCTCTCAGG AGGATACTTA GTGTATGTAG 29450 
CTATAATTCA TAGATTCCCA AGAAGTTTAG AGCCTAAAGT ATGAGGTCCC 2 9500 
ACCAGAGGGG CTATCATTAA ATTTAAAGAT TTGTTAAATC ATCTCATTGT 2 9550 
CCAACACCAC AAACTTGATT GCTTTAAAAT ACTGGTTTAG TTACATTTAG 29600 
TAACTCTATT AGTGCTTTTA ATCTATACTG CTATATCCTC ACATTGAGAT 29650 
TTTTTTTCTT TTCTCTTCCA TCTTCATTCT TTTTTCTCTC ATCCTCATTC 29700 
TTATAAGCCT AGAATACATC ACAAATCCTT TATGCCCATG GAAGCAAGAG 29750 
GAATAAAGAA TGGAGATGTT TGTTTTGCCA TTAACTAAAG ATCTGGGGTG 29800 
TCGGGGAGAA GGGGGATAGA GAAGGAGAAG TGGGAAGAGG TGTCCATAAT 29850 
AGCTTAGGTG CAATFCTGCT TATTTTACAT TTTACCCCCG CTGACTGCCA 29900 
CTTTTTCTTC AGCCCTCACA CATTGTTTGT GCAGGGACCT CATAGGACCA 29950 
GGAATTGTCT ATAGAGGTGG GAATTTGTCT CACCCTGAAA GGGATACCTC 30000 
TAGCATGGTA ATAGTCTTCT AGGATTTGTT ATCATATGGA AAGATGTAAA 30050 
GGGAGGGATT CTGCTGCTGC TGCTGCTGCT GCATGCAGTT GCCATTTCAT 30100 
TTAAATGACT TATTTATAAT TGATGACACT TTTCTGGCTT CCTGTTAATT 30150 
CCTCCCTCAA AGATCAATAA ACCAGAACCA GGCATGGTGG CATGCACTTG 30200 
TGGTCCTGTA ACCACCCAAC AGGTTCACCT TGCCTGCTGT CTAGATAGAG 30250 
CCAATTATCA AGACAGGGGA ATTGCAAAGG AGAAAGAGTA ATTTATGCAG 30300 
AGCCAGCTGT GCAGGAGACC AGAGTTTTAT TATTACTCAA ATCAGTCTCC 30350 
CCGAACATTC GAGGATCAGA GCTTTTAAGG ATAATTTGGC CGGTAGGGGC 30400 
TTAGGAAGTG GAGAGTGCTG GTTGGTCAGG TTGGAGATGG AATCACAGGG 30450 
AGTGGAAGTG AGGTTTTCTT GCTGTCTTCT GTTCCTGGAT GGGATGGCAG 30500 
AACTGGTTGG GCCAGATTAC CGGTCTGGGT GGTCTCAAAT GATCCACCCA 30550 
GTTCAGGGTC TGCAAGATAT CTCAAGCACT GATCTTAGGT TTTACAACAG 30600 
TGATGTTATC CCCAGGAACA ATTTGGGGAG GTTCAGACTC TTGGAGCCAG 30 650 
AGGCTGCATT ATCCCTAAAC CGTAATCTCT AATGTTGTAG CTAATTTGTT 30700 
AGTCCTGCAA AGGTAGACTT GTCCCCAGGC AAGAAGGGGG TCTTTTCAGA 30750 
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AAAGGGCTAT TATCATTTTT GTTTCAGAGT CAAACCATGA ACTGAATTTC 30800 

TTCCCAAAGT TAGTTCAGCC TACACCCAGG AATGAAGAAG GACAGCTTAA 30850 

AGGTTAGAAG CAAGATGGAG TCAATGAGGT CTGATCTCTT TCACTGTCAT 30900 

AATTTCCTCA GTTATAATTT TTGCAAAGGC GGTTTCAGTC CCAGCTACTT 30950 

GGGAGGCTGA GACAGGAGGA TTAATGGAGC CCAGGAGTTT GAGGTTGCAG 31000 

AGAGCTATGA TCACGCCACT GCACTCCAGC CTGGGTGACA GAGTGAGACC 31050 

CTGTCTCTAA ATAAATAAAT AAGTAAATAA ATAAATACAT AAATAAAATC 31100 

AAGATGGTGT GCAATTAGAA TTGAGCGATT TTGTTTCCAA ACCTCAAGAA 31150 

AGCTTGGTCT TGCTCTGTCC CAGGTGGCTG GATAAATTGG GCCTGTCAGC 31200 

CCGAATGGGA ATAGAAGTGG TGATGAGGCA AGTATTCTTT GGAGC AG GAA 31250 

ACTACCATTT AGTGGATGAA AACTTCGATC CTTTACCTGT AAGTGACCAT 31300 

TATTTTCCTA ATTCTAGTGG AGTAGATTAA AGTCAACTCA GGACCTCTGG 31350 

TGTTAACCTC CTATGAACAG TCAGTCCTCT CAGTAACTAG CCAAATCATG 31400 

AG AT GAT GAA TTAGAAGGAG CCTTAGATAG CATCCAATCT AACATTTTTT 31450 

TGTGTGTT-TG AAGAGAAGAA ATCAAGAGCT AGGAATAACT TTTTAAAGGT 31500 

AAGCCATTTG CAGTATAGTG TGGATTTTGT TTAAAAGGGG ATAATTTGAA 31550 

ATTTTATGAC TCATTATACA AGACAAAATA AGTTGGATTT TCAAATGTTT 31600 

TACAAAGTAA AT CAAAGTTA TAATTGCCTA CAGTACGCAA AGCTTCAAAA 31650 

CATTTTTTAT GTTATGAAAT TGTAATTTAT TTAACCTTAA AATGAGCCAG 31700 

TACCATGTGT TTGCTTAAAA ATCTCATGCT AAGAATTTAC TATGTTGTTA 31750 

ATAATCTTCA AGATATTTAT GAATAAAGTC TTATTTCTAA TCCTTCCTCC 31800 

AACTGTATCT GGTGCTAAAT CAGGAAATGT TTCTTCCCAA AAAGCCTCGT 31850 

GGAAGATCTG TATGTCTAAA TATATGTCAG GGATAATACA GATGTAGCCC 31900 

TGCGAAGCAT GACCTTGATT TTTATAGTCT AAAATGTCAT TTGCAGATAT 31950 

CTATTTTCTA AGAATAATTC CTAAAAGAAT TATTTGAATG TTGTAGGAAA 32000 

GCTAAGAAAT TTTGCAAAGA GCGTACGTGA AAATATAAGC TAGGCTTTTG 32050 

TGGTTTGTGG ATAGACTTCC CAACAAAATT GCTTTTTATC TATAGTGATC 32100 

CAAGCTTGTG GAACATATTA GTCATCTTTT TTTAGAAAAT TCTTAGAAAA 32150 

GTGATCTTGC AAAAATGGAA TTTATCTTTC CCCAAGTATA TTCTGTCATG 32200 

TATAGAGTTA AACTAAGCAT AGTAATTTCA CCAGACAAAC ATTCAAAATC 32250 

TACTCCTGAC CTTTTTATCT CATCCAAATT TTCCCAGGGC CCAGACATAA 32300 

ACCTTTGCCT TACGAACTCT TTGTATATGC ACTAAATATG CTTCTCCTTC 32350 

AAGGTTCTCA GTCAGCTAGA AAAATGTGCA AGAGTAAATG GTACCCTTCT 32400 

CACTTGTAGA TCCAAGAGAA TTAGACTTAA ACTCACTCTA CATGTCTGTG 32450 

ACTTTATTTT ATTTGCATGA CAGTCCTGTG AGGTGGCAAG GCAGGTATCT 32500 

TGGATCCATT TTTTAGATAA GGAAGTTCAA ATTGAGAAGA GGTTGCATGA 32550 

TTTACAGGAA GCCATACTGT AGTCCTATGT TACTCTTAAA AATCCCATTC 32 600 

AAATCCTGCT TCTGAGGCCT GC ATACTT TC TACCCTACCA GTCATTGACC 32650 

CATGCTTATG TCTCCTTTGA AAACATTGAT TCCACTCTTG TCTCCAGTGA 32700 

AAAAGTGGAA TTTAAGCAGA GAAACAAAAG CCATTTGTCT TGTTAAGTCT 32750 

ACTTTCCCTC TACTTTCAAG AAGGAAAGTT GGGGTATGTG TTGAATGGTG 32800 

ATTTATTTAT TTATTTATTA TTTTAAAAAT TGATACAAGG TCTTACTGTA 32850 

TTGTGCAGGC TGGTCTCAAA CTCCTGGGCT CAAGTGATCA TCCCACCTCA 32 900 

GCCTCCCAGT GTTGGGATTA CAGCATGAAC CATTGTGCCC ACCACCGATC 32 950 

CGCAGTTTTT TAAGAAAAAC TTTTACTATA GAAAATTTTA ATCATATACA 33000 

AAATACAGAG GAAAGT AT AT GAACCCACTT TAGGAGACTA GAATATGCCA 33050 

CCCCAAAATA TGCCACTTTG GCATAAGGAT TATTTCGAGC TAAAGGCAAC 33100 

TGGGAAGAAA CACATAGAAG AAAAGTTCTC TGTCCTTCTC CATTTGCCTA 33150 

AAAGCAGGAC ATGAATCTTA AAAGTCCCCC TCCTTCCCTT TCTACCAGGA 33200 

AAAACAAGAG TTAATCACTG AAGATAACTT CAGACCCTTA TCAGTGTAGA 33250 

GATGGCACTA GAAGAATCTA TATTACATAC TCATTTATTT TCCTTCCCAC 33300 

AACTTGCCAC CCCAGAGACT AAAAATCCTT TTCCTTTGTC ATGTCTCTTG 33350 

TCCAAAAATT TGCTCTATAA GCTGGAGTTC TAAGCCACCT CTTTGAGAAT 33400 

TACTTGTTCC CTGGTATTTT CTGTTAACAT ACATGTATTA ATATACATGT 33450 

TAACAAGCTT CTGTTTGTTT TTCTCCTGTT TTCTGTCTTG TTACAGAGGT 33500 

CCATCCCAAC TAAGAACTAA AGAGTAGGAG GAAAATATAA TTTCCTCCTG 33550 

CATACTTTGA TCTTGTTTAA TCCGTAACCC TTCCCACTTT TCACCTCCTA 33600 

CCTATTAGAT TACTTTGAAG CAAATTTCAG ATATATTACT TTATCTATAA 33 650 

ATATTTCAGT ATGTGCTAGG TGTGGTGGCT CACACCTGTA ATCCCAACAC 33700 

TTTGGGAAGC TGAGGCAGGA GGATCACTTG AGCCCAGGAG TTCAAGACCA 33750 

GCTACGGCAA CAAAAAATCA AAAACTTATC TGGGCATGGT GGCACATGCC 33800 

TGTGGTCCCA GCTACATGAG AGGCTGAGGC AGGAGGATCG CTTTAGCCCA 33850 

GGAGGTTGAG GCTGCAGTAA GCTGCATTCA CACCACTGCA CTCCAGCCTG 33900 

GGTGACAGAG TAAGACCATG TCTCAAAAAA ATACATATTT TAGTATGTAT 33950 

CCTTTTTGTA AAAACACAAT ACTTTTATCA TACTTTAAAT AATAACAATA 34000 

ATTCCTTAGT ATCACCAAAT ATTTTGTCAG TGTCTCACAT TTTCCTTATT 34050 

GTCTAAAATA TTGTTGATAG TTATTCAAAT CAGAATCCAA ACAAGGTCCA 34100 

TATATTACAT TTGGTTGACA AGTCTCTTAA GTTTGTTCAT CTTTAAGTTC 34150 

TTCCTCCCTC TCTTTCATCT CTTGTAATTT ATTAATGTGA AAAAACAGGT 34200 

AATTTGTTCT ATAGTATTTC CTACATTATA GAGTTTGCTA CATTTATTCC 34250 

C TAT GAT AT C ATTTAGCATG TTCCTCTGTC CCCTGTGTTT CCTGTAAACT 34300 

GGTAGTTATA CCTAGAAGCT TGAGTTTATT CAGGTTTTTA ATTGTATTTT 34350 

TTTTGCAAGA ATTCTTTATT ATCTGCTTCT GGAAGCACAG AATGTCTGGT 34 4 00 

TGTGTCTGGT TTTGATCTTG ACAGCTACTG ATGACCATTG CCTAATCCAT 34450 

TACTTTATTG GGGTGGGGGG AATAAGGTTT TAAAATAAAT TTTTTTTAAA 34500 

GATTTTTTTA ACTGTTATTT TGAGACAGTG TCTCATTTCG TTTCCCAGGC 34550 
TGGAGTGCAG TGGCACAATC ACGGCTCACT GCAGCCTTGA CCTCCTGGGA 34 600 
TCAGGTGATC TTCTCACCTC AGCCTCCTGG GTACCTGGAA CTACAGGTGC 34 650 
ACACCACCAC ACCTGGCTAA TTTTTTGTAT TTTGTGTACA GAAGGGGTTT 34700 
CATCATGTTT CCCAGACTGG TCTTGAACTC CTGGGTTCAA GTGATCTACC 347 50 
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CACTTCAGCT TCCCAAAATC CTGGGATTAC ACTTTGGCCA CCGTGCCTGG 34800 

CCTAAATGAA ATTATTTGTC TCTAAACAGA CAGAAGTTTT ACTTTAAAAA 34850 

TTTGTCTTTG TGTGTACATG TGTTTGTGTA TGTGTGTGTG TCTAAAAGTT 34 900 

TGGCTTTGAG CTTTGCTTTG AATTCTTGGA TGAACAATAA CCAAGAATAC 34 950 

TTAAACTCTG ATCATTCTTG ACAGATATCC CCTACAGGCT ATGGCCTTTT 35000 

GAATTGTGTC CTCCAGTGAT AAAAAGCAGC AAGCACGATA CTGCTCTCAG 35050 

ATTCATGGTG GTCACATGTG AGGTGAAAAA AAAAAAAAAG ATGAATCCTA 35100 

TTTAAATGCC CCCAGGATAA CAGTGATACT CTTTGTAGGA TAACTATTTG 35150 

CTTGCCACTG GTTTCATTAA ATAAGGACAT AAGTAAAGAT CTATTTTTGT 35200 

CTCTTTCTCC CCAACCACCA CAACTAGGAT TATTGGCTAT CTCTTCTGTT 35250 

CAAGAAATTG GTGGGCACCA AGGTGTTAAT GGCAAGCGTG CAAGGTTCAA 35300 

AGAGAAGGAA GCTTCGAGTA TACCTTCATT GCACAAACAC TGACAAGTAA 35350 

GTATGAAACA CACCCTTTAC CAATCATCAA GTTTTAGTGG GTAAGCCTGT 35400 

AACTTTACTC AAACACCCTG TTGCATGTGT CTATACATTG CATAAGTATA 35450 

GGCAGTTGCA ATTTAGTAAA GTTTTATACA ACGATTTTAT TTTATTTTAT 35500 

TTTTAGAAGA AAAATGCTAC TTTTGTTGTT GTTGTTTTTT GAGACGGGGC 35550 

CTCGCTCGTC ACCCAGGCTG GAGTGCAGTG GTGCAATCTC AGCTCACTGC 35600 

AACCTCCGCC TCCCGGGTTC AAGTGATTCT TGAAGAGGAG AACAATAATA 35650 

ACAACAATAT TATTTTCAAA AGTTGTGACC GCAGTTTCTG GAGT TGAGAA 35700 

GACATCGAGA TTTTTGTAGC CTCATACTCT TGCTTTAGGT AGCAAAAAAT 35750 

GTTCCTAAAT CTCAGGAATA TTCTCTAGAT AGGTTTCAAT CTATCATTCC 35800 

TGATAAGATG ATGCTGAAAT ACTAATTCTA GCCAAAAAAG ACCAGCTACC 35850 

ATTTCCGATT GTTGGGGACT GGGAACTCTG GATAGTGAGG ACCCCAGTAG 35900 

GAAGTAGCGA GGGGAATGGT TTGAATGGAT AAATTCATAA AAAATGTCAG 35950 

TAGATTTAAT TTTCTTATAC ATTTCAGTCT TTTTATAAGG CTAGGAAAAG 3 6000 

CCCCTGTTTT TATGGTTTAT AATTTGAATT CACATGAACC CACAAAATTT 3 6050 

GCCTTTTACC TTCCTATGTC TGAAAATGGA TAGTCTGGCT GGCCTCTTAA 3 6100 

CAACCCAGCT GGCAGAGCTG TGAGGATCTC AGTGTGCTCT AGCCCAGACA 3 6150 

TTGGTAGCAT GAACGGCAAC ATTTTTAATT GTGTTTTCAA AATAGGAGCA 3 620 0 

CACTAGCGGT CTAAAACGAT CATAAAAGAA GGATACTAAG AGGGCCCACT 36250 

GTCATTATGG ATCCTAATAC TTAGGATGCA TTATGGATTG TCATTATGGA 36300 

T AC TAATACT TAGGATCACA TTTGTAATTG AGTTTTTAAT TGCTTAAATT 36350 

AGATACATAT TTCTATTAAG TTAACCTCTT TGCTTTTAGT CCAAGGTATA 36400 

AAGAAGGAGA TTTAACTCTG TATGCCATAA ACCTCCATAA TGTCACCAAG 36450 

TACTTGCGGT TACCCTATCC TTTTTCTAAC AAGCAAGTGG ATAAATACCT 36500 

TCTAAGACCT TTGGGACCTC ATGGATTACT TTCCAAGTAA GTAATTTTCC 36550 

TTGTTCATTC CAAACTTTCA ATAAATTTAT TGGTGTTTAT CAGAATAGAG 36600 

AGTTTGGACA GGGAGCAAAA GACAAAGTCA ACTATATCAA GTTCTAATAA 36650 

TTCTTAATAT TCAGGAAATT TATGTATGAA TACTTACTAA TATGAGTATA 36700 

ACTCATCCTA AGAGTCTAAA GCAAAAGGAT GTGAACACAA ACTAGCAGTT 36750 

ATCTTAGAGA ATAAGTTTGC ATTTCAAAAT AACTTGACAT ATCAAGATCC 36800 

ACTCAACGCA TTTAAATTAT TTACTCTAAA AAGACATAAT TCTTGGTAAC 36850 

ACATTCACTA AAGCAAAATA TACCTTTATA TAATTGCTAT CAAAGGTATG 3 6900 

TGGGTTGGTA TAAAATATCA TACCATGTGA GATCAGTGTG ATTCCTTTAC 36950 

AGCATTAATT TTTATTGGTT AGAGTAAGAA AAAGAATAGC TAGAGTATAT 37000 

TTCTTAAGTA GATTCTCATA CACTTTGGTT TCAAAAACCA ATTATTGACT 37050 

ACATCTTATA AAAGCCTGTA TTCAATGGAG TGCCAAAAAA TGACTATGAG 37100 

TCTTAAAGAG T TAGGCAT AT AAATATTTTA AGGTTTCTGT TCAATGTATG 37150 

TTGGAAGGAG TTCCTTTCTC ATGACTATTC TCATATTGGA GCATAAAAAG 37200 

AGTTTACAGG CTTGGCGCAG TGGCTCATGC CTGTAATCCC AATACTTTGG 37250 

GAAGCTGAAG CAGGCAGATC ACTTCAGCCC AGGAGTTTGA GACCAGCCTG 37300 

GGCAATATGG CAAAACTCTC TCTACAAAAT ATACCAAAAT TAGCCAGGCG 37350 

TGGTGGTGCA TGCCTGTAGT CCCAGCTACT TGGGAAGCTG AGGTGGGAGG 37 4 00 

ATTGCTTGAG CCCAGGGGGG TCATGGCTGC AGTGAGCTGT GATGGTGCCT 37450 

CTGTCACCCA GCCTGGGTGA CAGAGTGAGA CCCTGTCTCA AAAAAATAAA 37500 

TAAATAAAAA TTAAGAGTTT AC AAAATT CT CACCATCTCC TCCCATCTTT 37550 

GCAAATGCCA CATAAGTGAT GTGTTCCAGG ACTATTAGCC TCGGAACCTG 37 600 

AGGCAGTACA GTAAGCACGC TTTCTCCAAA GTCCTGTCCC CCACAGACAA 37 650 

ACATTATTTA CACTGGGTAC TGCTCTTTTA TTTTTTCCCC TCTATGCTTT 37700 

ATTTTACTAT AACTATAATC ATATAACATG TAATAGGAAA AAGGCAGGGT 37750 

CGGGGGAGAG ATCCAGAAGT CTTCCCAAGA GCCTTTCCAA CATAGCCTCT 37800 

GTAGACATTT TTTCTTTCTT CTTTTTXTTT TTTTTTTTTT TTCTGAGACA 37850 

GAGTCTCACT CTGTTGTCCA GGCTAGAGTG CAGTGGCGTG ATCTAGGCTC 37900 

ACTGCAACCT CCGCCTCCTG GGTTCAAGCA ATTCTCCCAC CTCAGCCTCC 37 950 

CTAGTAGCTG GGATTAGAGG CATGCATCAC CACGCCTGGC TAATTTTTGT 38000 

ATTTTTAGTA GAGATGAGGT TTCACCATGT GGGCCAGGCT GGTCTTGAAC 38050 

TCCTGACCTC AAGTGATCCA CCTGCCTTAG CCTCCCAAAG TGCTAGGATT 38100 

ACACGAGTGA GCCACCGTGC CCTGCCCCTA TTACATTCTG ATCACACATT 38150 

TCATGTTTTA TAATTGGAAA ACTGGTGAAA TTATAGACAA TGTTTTGTTC 38200 

CCCTAAATTC TCTTTGATGA GTATATATTA CTTACACTCT TCTGTCTTTA 38250 

AAATTTTGCA AAATAGTATC CTAGATAAGT TTATGAGTGC ACAGTCTGTA 38300 

CGCTTACTCA TATTAATGAC CTCGGAGAGT TAAACAACAG TCACCTTTAA 38350 

AAATTATTAC TATCATTATC ATTATTTTTG AGGCGGGGGT CTCATTCTGT 38 400 

CTCCCAGGCT GGAGAGTAGT GGTGCGGTCA CAGCTCACTG CAGCCACCGC 38450 

TACCTGGGCT CAAGTGATCC TTCCTCCTCA GCCTTCTGAG TAGCTGAGAC 38500 

CACAGGCTTA TGCTACCACA CCTGGCTAAT TTTTTAACTT TTTGTAGAGA 38550 

CGATGTCTCA TTATGTTGCC CAGGCTGGTC TCAAACTCCT AAGCTCAAGT 38600 

GATCTTCCTC AGCCTCCCAA AGTGCTGGGA TTACAGGCAT GAAAAACTGC 38650 

ACCCAGCCCT AAAAAT T AT T AGGGTCCTGC ATAGTAAGAC TTTAATAAAT 38700 

ATTTAAATGA ACATCTGGTT TTTTTAAAAA AAAAATAGAG ACAAGGTCTC 38750 
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ACTATATTGC CCAAGCTGGT CTCGAACTCC TGGACTCACG CAATCCTGCT 38800 

GCCTTAGCCG CCCAAAGTGC TGGGATTACA GGCATGACCC ACCTCATCTG 38850 

GGCTGAGTGA ACATATTTTT AACATAAAGG CCGTATTTTA TATTTATCTC 38 900 

ATACATTTTG CCCAGCATCC CCATTTCCGC CGAATCTGTT GCTTGCTAAT 38 950 

TCCTTCCAGC TTCATTTCAT CTGAAATTTG ACAAACATCT TCTATTTCTT 39000 

TGTCGTCATG TTATTGACTT CAGAATATAA AATAAAACAC TATACCCAAA 39050 

TTAAACCCCA CCCTCATTGC CCAGCCTGAT GTGAAAATAA TCAGCATACA 39100 

TTAAGCTTAC CCTTGATATA TGTGTAGCAT CTTTTAGATA AATATACAGC 3 9150 

TGATTAAGCA ATATAGCCTG ATGGTATAAT ATCTTGCCCA TGTACCTCAT 3 9200 

CTTATCTCCA GCAGGATTAA TTCACAGTGA TCAGATTTAC CTTTAAACTT 3 9250 

TGTAGCAAAA TATCCTCTCC AAAAGCATAT CTAAAACTTT TGTGTGTACT 3 9300 

CTTGCAAGTT TCTTAATTTC ATGCAGAACA GGCTCTTACC ACTGTTAGCT 39350 

GGAGATATTT TCAAGACCTA TTTTTGTTTG TGGTTTCCTG ATGATGGTCA 3 9400 

TGGCATTTCC CCCTTCACTC CATCTAAAAA TTGAGGTGAT ACAGGCTTTT 39450 

AAACAAAACC AACTCATATA GACTGAGTAC AACTGCAATG CAGGCATGCT 39500 

AACCTCTGCT ACAATCATGG GCGTGCTATT GATATGTCTT AAGTTACAGA 39550 

ACACAGGGCT GAGCGTCTCA TTAGGTCAAA ATGTAAACCA GTTTTTCTGC 39600 

TCACTGATGC TTAATGAGGA CAGGGTGTGA GAGATTTCTT TAAGGAAAAC 3965 0 

AAATATATAA TAATGCTACA TGGAAAAATA TCTAACATTA GAGAATTAAG 3 9700 

TAAATAAACT AATATACTCA CACCATGGAA TCTTGTGCAG ACATTAAAAT 39750 

TATGTAGTGG ATGGATGTTT AATGGTGTGA GAAAAAGTTA GGATGTGCTG 3 9800 

GGGTGGGGGG AAGAATCAAG TTTTAAGAAA ATACAGTATA CCCATACTTA 39850 

AGTAAAAAAA AAAAAAAAGG TATGTACAGT CATGTGTTGC TTAATGATGG 3 9900 

GGATACATTC CGAGAAATGT GTCGATAGGT GATTTCATCC TTGTGTGAAC 39950 

ATCATAGAGT GAACTTACAC AAACCTAGAT GGTCTAGCCT ACTATGTATC 40000 

TAGGCTATAT GACTAGCCTG TTGCTCCTAG GCTACAAACC TGTAAAGCAT 40050 

GTTACTGTAG CGAATATACA AATACTTAAC ACAATGGCAA GCTATCATTG 40100 

TGTTAAGTAG TTGTGTATCT AAACATATCT AAAACATAGA AAACTAATGT 40150 

GTTGTGCTAC AATGTTACAA TGACTATGAC ATTGCTAGGC AATAGGAATT 4 0200 

ATAATTTTAT CCTTTTATGG AACCACACTT ATATATGCGG TCCATGGTGG 4 0250 

ACCAAAACAT CCTTATGTGG CATATGACTG TATACATGTA CACAAAAAAT 40300 

AGATGAAAGA ATGAATATAC ATCAAAATAT TTAAAATGGT TATAATGACT 40350 

TAGGTTACTT TTATTTATCT TAGTAATAAT AATGATGATA GATAATACTT 40400 

TTATAGTGTT TACTATATAA AAGACACTGT TATAAGTGTT CTACATACTT 40450 

TACATGTATT ACCTAAATGA TATAAATATA ACTCTGACAG TAACTAATCT 40500 

TATACGTTCT CTTTTCTTTT TTTTTTTTTT CTTTTTTTAG ACAGAATCTT 40550 

GCTCTACCAG GCTGGAGTGC AGGGTGCAAT CTCGGCTCAC TGCAACCTCC 40600 

GCCTCCCAGG TTCAAACGAT TCTCATGTCT CAGCCTCCTG AGTAGCTGGG 40650 

ACTACAGGCA CACACCACCA TGCCCGGCTA ATTTTTGTAT TTTTGGGTAG 40700 

AGATGGAGTT TTGCCATGTT GGCCAGGCTG ATCTTGAACT CCTGGCCTCA 40750 

AGTGATCTGC CTGCCTCAGC CTCCCAAAGT GCTGGGATTA CAGGTGTGAA 408 00 

CCACTGTGCT CGGCCTAATC TTACAAGTTT TCAATATTTA AAGAGTGCTA 40850 

ACTTTGTTGA CAATATAAAA CATATTTGAG AAAAAGAGAT ATAAGCATCT 40900 

TATTTAGAAT TATGAAAATA TCAATAGACC TACAGCCGAC TAAAGCTTTT 40950 

CTTCATAAGC TCTTGCCTAT ATTGATTCGC TCCTGTGAAT ATGCATTAAT 41000 

TTGATTTAAA TAATAAGTAT GTATAAGAAA TAACACTTTT CCTTAATTTT 41050 

TAAGAACGTT CAACAGTTTT TAATTTGAAT TCCAATAGTG AAATACATAG 41100 

AAAATATAAA ATTTTCTGTA GTTTAGCCAA ATTGTTTTTG TTTCACCACA 41150 

GCATTCTACC AAAATTTCTT AATAACAGTA AGAAAAT GAA TGCATACCTC 41200 

CTGCAGGGAG AGGGGAGTTA GGCAGTTTAT GGGCATAGTT ACAAGTGAGA 41250 

AATTTCATTG GCTACCATTT ACGCTAAATT CATAAAAACT GCATTCAATT 41300 

CTATATATCT ATTTTCTTTA CATAAAAAAG GTTTCAATTA TTGGCCATTA 41350 

AATAAAATAG CCACCATTCC AGAAGTTGTG TCATGTTTAT CCTTTTTATA 41400 

CCACCATCAT ATTGCCTATT AT ATAGAT T G TGTGTGTTCC ATTTTCTGTA 41450 

ATGGGCCAGA CAGTAAGTAT TTCTGGCTTT GGAGTCCATA TGGTCTCTAT 41500 

CATAACTACT CATCTCTGCC ATTGTAGCTT AAAGATTATC TAGGTCAAAT 41550 

GCCTAAGTGA TATAGTGTTG AAATACAAGT TATATAATAT AGGCTGCCAC 41600 

AAAAAAAAAT TTATTTGGTC TAAAAAAGAT TTCATGACTT TTGTAGCAGC 41650 

ATGGGTGGGG CATGCACCAC TTGGTTAACT CGGTGTATCT TTCTCCTTTG 41700 

CAGATCTGTC CAACTCAATG GTCTAACTCT AAAGATGGTG GATGATCAAA 41750 

CCTTGCCACC TTTAATGGAA AAACCTCTCC GGCCAGGAAG TTCACTGGGC 41800 

TTGCCAGCTT TCTCATATAG TTTTTTTGTG AT AAGAAAT G CCAAAGTTGC 41850 

TGCTTGCATC TGAAAATAAA ATATACTAGT CCTGACACTG AATTTTTCAA 41900 

GTATACTAAG AGTAAAGCAA CTCAAGTTAT AGGAAAGGAA GCAGATACCT 41950 

TGCAAAGCAA CTAGTGGGTG CTTGAGAGAC ACTGGGACAC TGTCAGTGCT 42000 

AGATTTAGCA CAGTATTTTG ATCTCGCTAG GTAGAACACT GCTAATAATA 42050 

ATAGCTAATA ATACCTTGTT CCAAATACTG CTTAGCATTT TGCATGTTTT 42100 

ACTTTTATCT AAAGTTTTGT TTTGTTTTAT TATTTATTTA TTTATTTATT 42150 

TTGAGACAGA ATCTCTCTCT GTCACCCAGG CTGGAGTGCC ATGGTGCGAT 42200 

CTTGGCTCAC TGCAACTTTA AGCAATTCTC CTGCCTCAGC TTCCTGAGTA 42250 

GCTGGGATTA TAGGCGTGTG CCACCACGCC CAGCTACTTT CTATATTTTT 42300 

TGTAGAGATG GAGTTTCGCC ATATTGGCCA AGCTGGTCTC GAACTCCTGT 42350 

CCTCGAACTC CTGTCCTCAA GTGATCCACC CGCCTCAGCC TCTCAAAGTG 42400 

CTGGGATTAC AGGTGTGAGC CACCACACCC AGCAGTGTTT TATTTTTGAG 42450 

ACAGGGTATC ATTCTGTTGC CCAGGCTTGA GTGCAGTGGT GCAATCATAG 42500 

ATCACTGCAG CCTTTTAACT CCTGGGCTCA AGTCATCCTC CTGCTTAGCC 42550 

TCCCAAGTAG CTAGGACCAC AGACACATGC CATCACACTT GGCTATTTTT 42600 

AAAAAATTTT TTGTAGAGAT GGGGTCTCGC TATGTTACCC AAACTGGTCC 42 650 

TGAACTCCTG GACTCAATTG ATCCTCCCAC CTTGGCCTTC CAGGTGCTGG 42700 

GATTTCTTTG GGAGTACAGC ATGGTACAGC AGGAGATCAT TTGATGTTAC 42750 



CTCTGTGCAG TGTTGCTAGT CAGCGAAAGA 
GCGATTAGCC ACCACAACCA GTCTTTATTT 
GGCGCAGTGG CTCACACCTG TAATCCTAGC 
ATGGATCACC TGACGTGAGG AA TTTGAGAC 
AACCCCATCT CTACTAAAAA ATACAAAAAT 
GTCCCAGCTA CTTGGGAGGC TGGGGCAGGA 
GCAGAGGTTG CAGTGAGCCG AGATTGTGCC 
ACAGAGAGAG ATTCCATCTC AAAAAAACAA 
ATGCTCCTAA TATGGTCAGG AAGCAAGGAA 
TTTAAGAAGG TGCTTAGCTG TATATTTATC 
TTTTAGAATT CTTTCCTTCA TGTGCCATCT 
AAGCATACTG CCGTTACCGT GAAACTGGTT 
TTGCACCTTA AAAGACAGCT AGATTTTGCT 
CTTTGTCAGC AATAATATGT GAGAGGACAG 
AAAAAATGGT TAATGACAAT TCAGAGGCGA 
ATTACTATAA ATGAAATTGA TTTGTCAAGA 
CCAATACCTT ATAACTGTCT GTTAATGCTT 
CCTTGTTTCA GTTGGGAAGC TTTTGGCTGC 
TCAAATGGCT TAAGCAATAA GGAAATGTAT 
TCAAACAGGC CAGGCTCCAG CACTTCAGTA 
CTTCCCAGCT CTCTGCTCTG CCATCTTTAG 
TCTGGTAGCA TGATGGCTGT AGCTGTTTCA 
AGCAACCAGA GGAAGAAAAT GAGCCATTTT 
TGAATAACTC TTTTTCAGAG CTTCTCACAG 
CTCATGTCTT ATTGTTCAGA AATGGGTAAT 
TGCCAACAAC AACGAGGTTC CTATAATTGT 
TGGAGAGGGT GTTGGTCAGT CTACAAACTG 
TTTACCAGTG AAAAAATGTA ATTATTTTCC 
TTCAAATGTA TGCCTGTTAT GGATATAGTA 
AATAGCTTTA GGGGTACACA CTTTTTGCTT 
GGTGAAGACT CGGCTTTTAA TGTACTTGTC 
ACCCAATAGG TAATTTTTCA TCCATTACCC 
CTGAGTCTCC AACATCCCTT ATACCACTGT 
AGCTAAGCTT CCACTTATAA GT GAGAAC AT 
CCTGAGTTAC TTCCCTTAGG ATAACAGCCC 
GCAAAATACA TTATTCTTCT TTATGGCTGA 
TATACCACAT TTTCTTTATC CACTTATCAG 
TTCCATTCAA TTTCATTCAA TTTAAGTATA 
AAAATTAAAT TTTAGATCTT TCAATACTCT 
TTTTTATATT TTCACATTTG AAATAAAGTA 
GTATGACTAT TCTTTTAGTA ATGTAAAGCC 
ACCACTAGTG TGTTGTTTCA CCCCTTGTTA 
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U 1 A I AA 1 ACO 




42 800 


AAAbj 1 1A1 1 A 




4 2850 


AC 111 bbjbrAbi 






CAGCCTGGCC 






1 AGG1 bbbi G 


1 Gbj 1 bb 1 \3 i A 


'iOUUU 


GAATTACTTG 




/! rj fief) 


ACTGCAG ICC 


AGGb. 1 baUbribi 


H O 1U U 


Gl 1 Al 1 AAAA 


AlbrlAIAl bjA 




GC G AAGG AT A 


T AT T ATG AGT 




TTTCAAAATG 


IA1 1 ftbnHuH 


j j u 


CTAC A GGC AC 


Cb-A 1 UAbrAAA 




G i AAAAGAUA 




*± 0 o 0 V 


GATTTTCTTC 


TTTCGGTTTT 


T J T U U 


A X 1 G 1 i Abj A 1 


A 1 LjA 1 Alj 1 .H. ± 




GG AGA 1 1 L, 1 Lj 


qi7i 7\ 7\ r-nrpT, 7\ 7\ 


43500 


r , f*'AT'A 7\ 7\ T 1 rrirn 

bbAlAAAl i 1 


1 Ao/ii^J^rt.^^i^-' 


43550 


GCTTTTTCTC 




1 jOUU 


A A l~" r P A A f~* A f"* 7\ 

AAbi i AAb-Al^A 




43650 


Ai ICGCAGA1 


AAL. 1 HbiiLu 1 




CG 1 CACOAbjbj 


LjA lLl Ijvto I 1 




GGG1 GGG llL 


A I I b* 1 t^AbiAb-- 


4 JO v U 


i GGGGGGG i 1 


LAAAbb 1 ^A 1 


4 jOjU 


TTGAGTCTCC 


i 1 G A 1 AbjAb. I 




CAAACCTCTC 


CTCATGTCTC 


4 jyou 


GTGGCCATTT 


CACGAbi 1 bAb 




CTCTGAGTAA 


1-1 /-1 .-i rp rp rp f~- f 71 7\ 

GGG 111 bjbrAA 




AACACTGCAG 


TTCTGCGCTT 


44100 


CCTCTTAAGG 


ATTAATATTC 


4 4 lOU 


TCTTTAAAAT 


TTTTTATTTT 


4 4 Z U U 


ACAGGGGTGA 


ATTGTGTAGT 


44Z OU 


ACCTGAGTGA 


TGTACATTGT 


44300 


TCCTTCCGCC 


CTCTTCCCTT 


4 4ooU 


GTATGTTCTT 


GT GTACCTAC 


/1 a a n n 
4 34UU 


GCAGTATTTG 


GTTTTCCATT 


yi /l ^ t; n 


CCAGTTC CGT 


bbAAb 1 1 brC 1 


4 4 ouu 


GTAATAGTCC 


AT GGTAC AT A 


44550 


T T G AT GG AC A 


CTTAGGTTAA 


44500 


TTTGTAAGGA 


GCTAAAGCTG 


44650 


TAAATTTTAT 


ATGTAAGTGG 


44700 


ATTTTTATAA 


CCTTGATATT 


44750 


TACAGACTCC 


TACATTTGGA 


44800 


TACTATCAGG 


ATCCTCGA 


44848 



(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2396 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

TTTCTAGTTG CTTTTAGCCA ATGTCGGATC AGGTTTTTCA AGCGACAAAG 50 

AGATACTGAG ATCCTGGGCA GAGGACATCC TAGCTCGGTC AGATTTGGGC 100 

AGGCTCAAGT GACCAGTGTC TTAAGGCAGA AGGGAGTCGG GGTAGGGTCT 150 

GGCTGAACCC TCAACCGGGG CTTTTAACTC AGGGTCTAGT CCTGGCGCCA 200 

AATGGATGGG ACCTAGAAAA GGTGACAGAG TGCGCAGGAC ACCAGGAAGC 250 

TGGTCCCACC CCTGCGCGGC TCCCGGGCGC TCCCTCCCCA GGCCTCCGAG 300 

GATCTTGGAT TCTGGCCACC TCCGCACCCT TTGGATGGGT GTGGATGATT 350 

TCAAAAGTGG ACGTGACCGC GGCGGAGGGG AAAGCCAGCA CGGAAAT GAA 400 

AGAGAGC GAG GAGGGGAGGG CGGGGAGGGG AGGGCGCTAG GGAGGGACTC 450 

CCGGGAGGGG TGGGAGGGAT GGAGCGCTGT GGGAGGGTAC TGAGTCCTGG 500 

CGCCAGAGGC GAAGCAGGAC CGGTTGCAGG GGGCTTGAGC CAGCGCGCCG 550 

GCTGCCCCAG CTCTCCCGGC AGCGGGCGGT CCAGCCAGGT GGGATGCTGA 600 

GGCTGCTGCT GCTGTGGCTC TGGGGGCCGC TCGGTGCCCT GGCCCAGGGC 650 

GCCCCCGCGG GGACCGCGCC GACCGACGAC GTGGTAGACT TGGAGTTTTA 700 

CACCAAGCGG CCGCTCCGAA GCGTGAGTCC CTCGTTCCTG TCCATCACCA 750 

TCGACGCCAG CCTGGCCACC GACCCGCGCT TCCTCACCTT CCTGGGCTCT 800 

CCAAGGCTCC GTGCTCTGGC TAGAGGCTTA TCTCCTGCAT ACTTGAGATT 850 

TGGCGGCACA AAGACTGACT TCCTTATTTT TGATCCGGAC AAGGAACCGA 900 

CTTCCGAAGA AAGAAGTTAC TGGAAATCTC AAGTCAACCA TGATATTTGC 950 



AGGTCTGAGC CGGTCTCTGC TGCGGTGTTG 
GCCCTTCCAG GAGCTGTTGC TGCTCCGAGA 
AGAACAGCAC CTACTCAAGA AGCTCAGTGG 
AAGTGCTCGG GGTTAGACCT GATCTTTGGT 
CCCAGACTTA CGGTGGAACA GcTCCAACGC 
GCTCTTCCAA GGGTTATAAC ATcTCCTGGG 
AGTTTcTGGA AGAAAGCTCA CATTCTCATC 
AGACTTTGTG GAGTTGCATA AACTTcTACA 
CAAAACTCTA TGGTCCTGAC ATCGGTCAGC 
CTGCTGAGGA GTTTCCTGAA GGCTGGCGGA 
ATGGCATCAC TATTACTTGA ATGGACGCAT 
TGAGCTCTGA TGCGCTGGAC ACTTTTATTC 
AAGGTCACTA AAGAGATCAC ACCTGGCAAG 
GAGCTCAGCT TACGGTGGCG GTGCACCCTT 
CTGGCTTTAT GTGGCTGGAT AAATTGGGCC 
GAAGTCGTGA TGAGGCAGGT GTTCTTCGGA 
GG AT GAAAAC TTTGAGCCTT TACCTGATTA 
AGAAACTGGT AGGTCCCAGG GTGTTACTGT 
AGGAGCAAAC TCCGAGTGTA TCTCCACTGC 
ATATCAGGAA GGAGATCTAA CTCTGTATGT 
CCAAGCACTT GAAGGTACCG CCTCCGTTGT 
TACCTTCTGA AGCCTTCGGG GCCGGATGGA 
ACTGAACGGT CAAATTCTGA AGATGGTGGA 
TGACAGAAAA ACCTCTCCCC GCAGGAAGTG 
TCCTATGGTT TTTTTGTCAT AAGAAATGCC 
AAAATAAAAG GCATACGGTA CCCCTGAGAC 
TTCATAAAAC AAAACCCTAG TTTAGGAGGC 
GAGCTTCGGG AGGGTGGGGT ACACTTCAGT 
CTCTCTAAGA AGAATACTGC AGGTGGTGAC 
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AGGAAACTCC 


AGGTGGAATG 


1000 


GCAGTACCAA 


AAGGAGTTCA 


1050 


ACATGCTCTA 


CAGTTTTGCC 


1100 


CTAAATGCGT 


TACTACGAAC 


1150 


CCAGCTTCTC 


CTTGACTACT 


1200 


AACTGGGCAA 


TGAGCCCAAC 


1250 


GATGGGTTGC 


AGTTAGGAGA 


1300 


AAGGTCAGCT 


TTCCAAAATG 


1350 


CTCGAGGGAA 


GACAGTTAAA 


1400 


GAAGTGATCG 


ACTCTCTTAC 


1450 


CGCTACCAAA 


GAAGATTTTC 


1500 


TCTCTGTGCA 


AAAAATTCTG 


1550 


AAGGTCTGGT 


TGGGAGAGAC 


1600 


GCTGTCCAAC 


ACCTTTGCAG 


1 650 


TGTCAGCCCA 


GATGGGCATA 


1700 


GCAGGCAACT 


ACCACTTAGT 


1750 


CTGGCTCTCT 


CTTCTGTTCA 


1800 


CAAGAGTGAA 


AGGCCCAGAC 


1850 


ACTAACGTCT 


ATCACCCACG 


1900 


CCTGAACCTC 


CATAATGTCA 


1950 


TCAGGAAACC 


AGTGGATACG 


2000 


TTACTTTCCA 


AATCTGTCCA 


2050 


TGAGCAGACC 


CTGCCAGCTT 


2100 


CACTAAGCCT 


GCCTGCCTTT 


2150 


AAAATCGCTG 


CTTGTATATG 


2200 


AAAAGCCGAG 


GGGGGTGTTA 


2250 


CACCTCCTTG 


CCGAGTTCCA 


2300 


ATTACATTCA 


GTGTGGTGTT 


2350 


AGTTAATAGC 


ACTGTG 


2396 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44 
Met Leu Arg Leu Leu Leu Leu Trp Leu Trp Gly Pro Leu Gly Ala 
5 10 15 

Leu Ala Gin Gly Ala Pro Ala Gly Thr Ala Pro Thr Asp Asp Val 
20 25 30 

Val Asp Leu Glu Phe Tyr Thr Lys Arg Pro Leu Arg Ser Val Ser 
35 40 45 

Pro Ser Phe Leu Ser lie Thr lie Asp Ala Ser Leu Ala Thr Asp 
50 55 60 

Pro Arg Phe Leu Thr Phe Leu Gly Ser Pro Arg Leu Arg Ala Leu 
65 70 75 

Ala Arg Gly Leu Ser Pro Ala Tyr Leu Arg Phe Gly Gly Thr Lys 
80 85 90 

Thr Asp Phe Leu lie Phe Asp Pro Asp Lys Glu Pro Thr Ser Glu 
95 100 105 

Glu Arg Ser Tyr Trp Lys Ser Gin Val Asn His Asp lie Cys Arg 
110 115 120 

Ser Glu Pro Val Ser Ala Ala Val Leu Arg Lys Leu Gin Val Glu 
125 130 135 

Trp Pro Phe Gin Glu Leu Leu Leu Leu Arg Glu Gin Tyr Gin Lys 
140 145 150 
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Glu Phe Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Met Leu 
155 160 165 

Tyr Ser Phe Ala Lys Cys Ser Gly Leu Asp Leu lie Phe Gly Leu 
170 175 180 

Asn Ala Leu Leu Arg Thr Pro Asp Leu Arg Trp Asn Ser Ser Asn 
185 190 195 

Ala Gin Leu Leu Leu Asp Tyr Cys Ser Ser Lys Gly Tyr Asn He 
200 205 210 

Ser Trp Glu Leu Gly Asn Glu Pro Asn Ser Phe Trp Lys Lys Ala 
215 220 225 

His He Leu lie Asp Gly Leu Gin Leu Gly Glu Asp Phe Val Glu 
230 235 240 

Leu His Lys Leu Leu Gin Arg Ser Ala Phe Gin Asn Ala Lys Leu 
245 250 255 

Tyr Gly Pro Asp He Gly Gin Pro Arg Gly Lys Thr Val Lys Leu 
260 265 270 

Leu Arg Ser Phe Leu Lys Ala Gly Gly Glu Val He Asp Ser Leu 
275 280 285 

Thr Trp His His Tyr Tyr Leu Asn Gly Arg lie Ala Thr Lys Glu 
290 295 300 

Asp Phe Leu Ser Ser Asp Ala Leu Asp Thr Phe He Leu Ser Val 
305 310 315 

Gin Lys He Leu Lys Val Thr Lys Glu He Thr Pro Gly Lys Lys 
320 325 330 

Val Trp Leu Gly Glu Thr Ser Ser Ala Tyr Gly Gly Gly Ala Pro 
335 340 345 

Leu Leu Ser Asn Thr Phe Ala Ala Gly Phe Met Trp Leu Asp Lys 
350 355 360 

Leu Gly Leu Ser Ala Gin Met Gly He Glu Val Val Met Arg Gin 
365 370 375 

Val Phe Phe Gly Ala Gly Asn Tyr His Leu Val Asp Glu Asn Phe 
380 385 390 

Glu Pro Leu Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu 
395 400 405 

Val Gly Pro Arg Val Leu Leu Ser Arg Val Lys Gly Pro Asp Arg 
410 415 420 

Ser Lys Leu Arg Val Tyr Leu His Cys Thr Asn Val Tyr His Pro 
425 430 435 

Arg Tyr Gin Glu Gly Asp Leu Thr Leu Tyr Val Leu Asn Leu His 
440 445 450 

Asn Val Thr Lys His Leu Lys Val Pro Pro Pro Leu Phe Arg Lys 
455 460 465 

Pro Val Asp Thr Tyr Leu Leu Lys Pro Ser Gly Pro Asp Gly Leu 
470 475 480 

Leu Ser Lys Ser Val Gin Leu Asn Gly Gin He Leu Lys Met Val 
485 490 495 

Asp Glu Gin Thr Leu Pro Ala Leu Thr Glu Lys Pro Leu Pro Ala 
500 505 510 

Gly Ser Ala Leu Ser Leu Pro Ala Phe Ser Tyr Gly Phe Phe Val 
515 520 525 

He Arg Asn Ala Lys He Ala Ala Cys He 
530 535 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2396 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

TT TCT AGT 8 

TGC TTT TAG CCA ATG TCG GAT CAG GTT TTT CAA GCG ACA AAG AGA 53 

TAC TGA GAT CCT GGG CAG AGG ACA TCC TAG CTC GGT CAG ATT TGG 98 

GCA GGC TCA AGT GAC CAG TGT CTT AAG GCA GAA GGG AGT CGG GGT 143 

AGG GTC TGG CTG AAC CCT CAA CCG GGG CTT TTA ACT CAG GGT CTA 188 

GTC CTG GCG CCA AAT GGA TGG GAC CTA GAA AAG GTG ACA GAG TGC 233 

GCA GGA CAC CAG GAA GCT GGT CCC ACC CCT GCG CGG CTC CCG GGC 278 

GCT CCC TCC CCA GGC CTC CGA GGA TCT TGG ATT CTG GCC ACC TCC 323 

GCA CCC TTT GGA TGG GTG TGG ATG ATT TCA AAA GTG GAC GTG ACC 368 

GCG GCG GAG GGG AAA GCC AGC ACG GAA ATG AAA GAG AGC GAG GAG 413 

GGG AGG GCG GGG AGG GGA GGG CGC TAG GGA GGG ACT CCC GGG AGG 458 

GGT GGG AGG GAT GGA GCG CTG TGG GAG GGT ACT GAG TCC TGG CGC 503 

CAG AGG CGA AGC AGG ACC GGT TGC AGG GGG CTT GAG CCA GCG CGC 54 8 

CGG CTG CCC CAG CTC TCC CGG CAG CGG GCG GTC CAG CCA GGT GGG 593 

ATG CTG AGG CTG CTG CTG CTG TGG CTC TGG GGG CCG CTC GGT GCC 638 
Met Leu Arg Leu Leu Leu Leu Trp Leu Trp Gly Pro Leu Gly Ala 
5 10 15 

CTG GCC CAG GGC GCC CCC GCG GGG ACC GCG CCG ACC GAC GAC GTG 683 
Leu Ala Gin Gly Ala Pro Ala Gly Thr Ala Pro Thr Asp Asp Val 
20 25 30 

GTA GAC TTG GAG TTT TAC ACC AAG CGG CCG CTC CGA AGC GTG AGT 728 
Val Asp Leu Glu Phe Tyr Thr Lys Arg Pro Leu Arg Ser Val Ser 
35 40 45 

CCC TCG TTC CTG TCC ATC ACC ATC GAC GCC AGC CTG GCC ACC GAC 773 
Pro Ser Phe Leu Ser lie Thr lie Asp Ala Ser Leu Ala Thr Asp 
50 55 60 

CCG CGC TTC CTC ACC TTC CTG GGC TCT CCA AGG CTC CGT GCT CTG 818 
Pro Arg Phe Leu Thr Phe Leu Gly Ser Pro Arg Leu Arg Ala Leu 
65 70 75 

GCT AGA GGC TTA TCT CCT GCA TAC TTG AGA TTT GGC GGC ACA AAG 863 
Ala Arg Gly Leu Ser Pro Ala Tyr Leu Arg Phe Gly Gly Thr Lys 
80 85 90 

ACT GAC TTC CTT ATT TTT GAT CCG GAC AAG GAA CCG ACT TCC GAA 908 

Thr Asp Phe Leu lie Phe Asp Pro Asp Lys Glu Pro Thr Ser Glu 
95 100 105 

GAA AGA AGT TAC TGG AAA TCT CAA GTC AAC CAT GAT ATT TGC AGG 953 
Glu Arg Ser Tyr Trp Lys Ser Gin Val Asn His Asp lie Cys Arg 
110 115 120 



TCT GAG CCG GTC TCT GCT GCG GTG TTG AGG AAA CTC CAG GTG GAA 
Ser Glu Pro Val Ser Ala Ala Val Leu Arg Lys Leu Gin Val Glu 
125 130 135 



998 



TGG CCC TTC CAG GAG CTG TTG CTG CTC CGA GAG CAG TAC CAA AAG 
Trp Pro Phe Gin Glu Leu Leu Leu Leu Arg Glu Gin Tyr Gin Lys 
140 145 150 



1043 
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GAG TTC AAG AAC AGC ACC TAC TCA AGA AGC TCA GTG GAC ATG CTC 1088 
Glu Phe Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Met Leu 
155 160 1S5 

TAC AGT TTT GCC AAG TGC TCG GGG TTA GAC CTG ATC TTT GGT CTA 1133 
Tyr Ser Phe Ala Lys Cys Ser Gly Leu Asp Leu He Phe Gly Leu 
170 175 180 

AAT GCG TTA CTA CGA ACC CCA GAC TTA CGG TGG AAC AGc TCC AAC 1178 
Asn Ala Leu Leu Arg Thr Pro Asp Leu Arg Trp Asn Ser Ser Asn 
185 190 195 

GCC CAG CTT CTC CTT GAC TAC TGC TCT TCC AAG GGT TAT AAC ATc 1223 
Ala Gin Leu Leu Leu Asp Tyr Cys Ser Ser Lys Gly Tyr Asn He 
200 205 210 

TCC TGG GAA CTG GGC AAT GAG CCC AAC AGT TTc TGG AAG AAA GCT 12 68 
Ser Trp Glu Leu Gly Asn Glu Pro Asn Ser Phe Trp Lys Lys Ala 
215 220 225 

CAC ATT CTC ATC GAT GGG TTG CAG TTA GGA GAA GAC TTT GTG GAG 1313 
His He Leu He Asp Gly Leu Gin Leu Gly Glu Asp Phe Val Glu 
230 235 240 

TTG CAT AAA CTT CTA CAA AGG TCA GCT TTC CAA AAT GCA AAA CTC 1358 
Leu His Lys Leu Leu Gin Arg Ser Ala Phe Gin Asn Ala Lys Leu 
245 250 255 

TAT GGT CCT GAC ATC GGT CAG CCT CGA GGG AAG ACA GTT AAA CTG 1403 
Tyr Gly Pro Asp He Gly Gin Pro Arg Gly Lys Thr Val Lys Leu 
260 265 270 

CTG AGG AGT TTC CTG AAG GCT GGC GGA GAA GTG ATC GAC TCT CTT 1448 
Leu Arg Ser Phe Leu Lys Ala Gly Gly Glu Val He Asp Ser Leu 
275 280 285 

ACA TGG CAT CAC TAT TAC TTG AAT GGA CGC ATC GCT ACC AAA GAA 1493 
Thr Trp His His Tyr Tyr Leu Asn Gly Arg He Ala Thr Lys Glu 
290 295 300 

GAT TTT CTG AGC TCT GAT GCG CTG GAC ACT TTT ATT CTC TCT GTG 1538 
Asp Phe Leu Ser Ser Asp Ala Leu Asp Thr Phe He Leu Ser Val 
305 310 315 

CAA AAA ATT CTG AAG GTC ACT AAA GAG ATC ACA CCT GGC AAG AAG 1583 
Gin Lys He Leu Lys Val Thr Lys Glu He Thr Pro Gly Lys Lys 
320 325 330 

GTC TGG TTG GGA GAG ACG AGC TCA GCT TAC GGT GGC GGT GCA CCC 1628 
Val Trp Leu Gly Glu Thr Ser Ser Ala Tyr Gly Gly Gly Ala Pro 
335 340 345 



TTG CTG TCC AAC ACC TTT GCA GCT GGC TTT ATG TGG CTG GAT AAA 
Leu Leu Ser Asn Thr Phe Ala Ala Gly Phe Met Trp Leu Asp Lys 
350 355 360 



1673 
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TTG GGC CTG TCA GCC CAG ATG GGC ATA GAA GTC GTG ATG AGG CAG 1718 
Leu Gly Leu Ser Ala Gin Met Gly He Glu Val Val Met Arg Gin 
365 370 375 

GTG TTC TTC GGA GCA GGC AAC TAC CAC TTA GTG GAT GAA AAC TTT 17 63 
Val Phe Phe Gly Ala Gly Asn Tyr His Leu Val Asp Glu Asn Phe 
380 385 390 

GAG CCT TTA CCT GAT TAC TGG CTC TCT CTT CTG TTC AAG AAA CTG 1808 
GLu Pro Leu Pro Asp Tyr Trp Leu Ser Leu Leu Phe Lys Lys Leu 
395 400 405 

GTA GGT CCC AGG GTG TTA CTG TCA AGA GTG AAA GGC CCA GAC AGG 1853 
Val Gly Pro Arg Val Leu Leu Ser Arg Val Lys Gly Pro Asp Arg 
410 415 420 

AGC AAA CTC CGA GTG TAT CTC CAC TGC ACT AAC GTC TAT CAC CCA 18 98 
Ser Lys Leu Arg Val Tyr Leu His Cys Thr Asn Val Tyr His Pro 
425 430 435 

CGA TAT CAG GAA GGA GAT CTA ACT CTG TAT GTC CTG AAC CTC CAT 1943 
Arg Tyr Gin Glu Gly Asp Leu Thr Leu Tyr Val Leu Asn Leu His 
440 445 450 

AAT GTC ACC AAG CAC TTG AAG GTA CCG CCT CCG TTG TTC AGG AAA 1988 
Asn Val Thr Lys His Leu Lys Val Pro Pro Pro Leu Phe Arg Lys 
455 460 465 

CCA GTG GAT ACG TAC CTT CTG AAG CCT TCG GGG CCG GAT GGA TTA 2 033 
Pro Val Asp Thr Tyr Leu Leu Lys Pro Ser Gly Pro Asp Gly Leu 
470 475 480 

CTT TCC AAA TCT GTC CAA CTG AAC GGT CAA ATT CTG AAG ATG GTG 2 078 
Leu Ser Lys Ser Val Gin Leu Asn Gly Gin He Leu Lys Met Val 
485 490 495 

GAT GAG CAG ACC CTG CCA GCT TTG ACA GAA AAA CCT CTC CCC GCA 2123 
Asp Glu Gin Thr Leu Pro Ala Leu Thr Glu Lys Pro Leu Pro Ala 
500 505 510 

GGA AGT GCA CTA AGC CTG CCT GCC TTT TCC TAT GGT TTT TTT GTC 2168 
Gly Ser Ala Leu Ser Leu Pro Ala Phe Ser Tyr Gly Phe Phe Val 
515 520 525 

ATA AGA AAT GCC AAA ATC GCT GCT TGT ATA TGA AAA TAA AAG GCA 2213 
He Arg Asn Ala Lys He Ala Ala Cys He 
530 535 

TAC GGT ACC CCT GAG ACA AAA GCC GAG GGG GGT GTT ATT CAT AAA 2258 

ACA AAA CCC TAG TTT AGG AGG CCA CCT CCT TGC CGA GTT CCA GAG 2303 

CTT CGG GAG GGT GGG GTA CAC TTC AGT ATT ACA TTC AGT GTG GTG 2348 

TTC TCT CTA AGA AGA ATA CTG CAG GTG GTG ACA GTT AAT AGC ACT 2393 

GTG 2396 



(2) 



INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 46 
CGGCCGCTGC TGCTGCTGTG GCTCTGGGGG CGGCTCCGTG CCCTGACCCA 50 
AGGCACTCCG GCGGGGACCG CGCCGACCAA AGACGTGGTG GACTTGGAGT 100 
TTTACACCAA GAGGCTATTC CAAAGCGTGA GTCCCTCGTT CCTGTCCATC 150 
ACCATCGACG CCAGTCTGGC CACCGACCCT CGGTTCCTCA CCTTCCTGAG 200 
CTCTCCACGG CTTCGAGCCC TGTCTAGAGG CTTATCTCCT GCGTACTTGA 250 
GATTTGGCGG CACCAAGACT GACTTCCTTA TTTTTGATCC CAACAACGAA 300 
CCCACCTCTG AAGAAAGAAG TTACTGGCAA TCTCAAGACA ACAATGATAT 350 
TTGCGGGTCT GACCGGGTCT CCGCTGACGT GTTGA 385 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

AAATCAGGAC ATATCCTTCA CTTATTTGCC TCTTGGTCAT ATTGGAGGCA 50 

TTTGTATTCA TTTTTAATAA CCCTCAAAAT AGTGCATGCA AAGTGCTAAG 100 

CGTCATTTGC CACATGGTGC CATTAACTGT CACCACCTGC AGTGGTCTAC 150 

TTAGAGAACA CCGCACTGGA TGTTAACACT GAAGCGCGTG CCCCGCCCTC 200 

CCGAGGCTCT GGATCCAGCG TTGAAGCTTG CCCCGCCCTC CCGAGGCTCT 250 

GGATCCAGCA CTGGAGCATG CCCCGCCCTC CCGAGGCTCT GGAGCTTGCT 300 

AAGGAGTCCG CTCCCTACCG CTGGGGTTTT GCTTTATTCT TATGAATGAC 350 

ACCCCTGACC GCTTTCGTCT CAGGGGTACT GTAATGCCTT TTATTTTCAT 400 

ATACAAGCTG CGATTTTGGC ATTTCTTATG ACAAAAAACC CATAGGAAAA 450 

GGCGGGCACG CTTAGTGAGC TTCCTGCGGG GAGAGGTTTT TCTGTTAGAG 500 

CTGGCANGGT CTGCTCATCG ACCATCTTCA GGCCTCGTGC C 541 



